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We have detected the weak lensing signal induced by the cluster of galaxies 
MS 2053-04 (z = 0.58) from a two-colour mosaic of 6 HST WFPC2 images. 

The best fit singular isothermal sphere model to the observed tangential distortion 
yields an Einstein radius = 6'.'2 ± I'.'S, which corresponds to a velocity dispersion 
of 88611^39 km/s {Vim — 0.3, Qa = 0.0). This result is in good agreement with the 
observed velocity dispersion of 817 ± 80 km/s from cluster members. The observed 
average restframe mass-to-light ratio within a 1 h^^ Mpc radius aperture is 184 ± 
56 Mq /Lb0 . After correction for luminosity evolution to z = this value changes 
to 291±89±19 /150 Mq/Lbo (where the first error indicates the statistical uncertainty 
in the measurement of the mass-to-light ratio, and the second error is due to the 
uncertainty in luminosity evolution). 

MS 2053 is the third cluster we studied using mosaics of deep WFPC2 images. For 
all three clusters we find good agreement between dynamical and weak lensing velocity 
dispersions, in contrast to weak lensing studies based on single WFPC2 pointings on 
cluster cores. This result demonstrates the importance of wide field data. 

We have compared the ensemble averaged cluster profile to the predicted NFW 
profile, and find that a NFW profile can fit the observed lensing signal well. The 
best fit concentration parameter is found to be 0.79;|^q;^5 (68% confidence) times the 
predicted value from an open CDM model. 

The observed mass-to-light ratios of the clusters in our sample evolve with redshift, 
and are inconsistent with a constant, non-evolving, mass-to-light ratio at the 99% 
confidence level. The evolution is consistent with the results derived from the evolution 
of the fundamental plane of early type galaxies. The resulting average mass-to-light 
ratio for massive clusters at z = is found to be 239 ± 18 ± 9 h^o Mq/Lb©. 



1 INTRODUCTION 



Observations of high redshift clusters are valuable to test 
our current understanding of structure formation on cosmo- 
logical scales (e.g. Eke, Cole & Frenk 1996; Bahcall & Fan 
1998). In particular reliable mass estimates of these systems 
are important, as they provide strong constraints on cosmo- 
logical models. 



* Based on observations with the NASA/ESA Hubble Space Tele- 
scope obtained at the Space Telescope Science Institute, which is 
operated by the Association of Universities for Research in As- 
tronomy, Inc., under NASA contract NAS 5-26555 
t Hubble Fellow 



The small, systematic, distortion in the shapes of back- 
ground sources induced by massive structures, known as 
weak gravitational lensing, has proven to be a powerful 
method to measure the masses of clusters of galaxies (for 
an extensive review see Mellier 1999). The weak lensing ef- 
fect allows one to reconstruct the projected surface mass 
density (e.g. Kaiser & Squires 1993) or measure the mass, 
without having to rely on assumptions about the state or 
nature of the deflecting matter. However, for an accurate 
mass estimate high number densities of background galax- 
ies are needed, as well as a good estimate of their redshift 
distribution. 

Lensing studies of high redshift clusters {z > 0.5) are 
difficult because the lensing signal is low and most of the 
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signal comes from small, faint sources. These sources typ- 
ically have sizes which are comparable to the size of the 
PSF in ground based images. To extract the lensing signal 
from such observations large corrections are required. For 
these studies HST observations have great advantage over 
ground based observations because the background sources 
are much better resolved, resulting in a well calibrated weak 
lensing signal. 

In this paper we present the results of our weak lens- 
ing analysis of the z = 0.58 cluster of galaxies MS 2053-04. 
It is the third cluster of which we studied the mass distri- 
bution based on a deep two-colour mosaic of WFPC2 im- 
ages. The other two clusters that have been studied this 
way are CI 1358+62 {z = 0.33) (Hoekstra et al. 1998; HFKS 
hereafter), and MS 1054-03 (z = 0.83) (Hoekstra, Franx, & 
Kuijken 2000; HFK hereafter). All three clusters have been 
selected on the basis of their strong X-ray emission. 

MS 2053 was detected in the Einstein Medium Sensi- 
tivity Survey (Gioia & Luppino 1994). It is one of the few 
z > 0.5 clusters found in this survey, and of these high red- 
shift clusters it has the lowest X-ray luminosity. Its X-ray 
luminosity^] is L^{2 - 10 keV) = (7.9 ± 0.7) x 10*** h'f 
ergs/s (Henry 2000). The X-ray temperature measured by 
BeppoSAX is kT = 6.7t| j keV (Delia Ceca et al. 2000). A 
more accurate temperature of kT = 8.1^2 j been 
determined from ASCA observations (Henry 2000). 

Luppino & Gioia (1992) discovered a gravitationally 
lensed arc in deep images of MS 2053. The arc is located 
approximately 15 arcseconds from the Brightest Cluster 
Galaxy (BCG). Its redshift is still unknown. The cluster 
mass distribution has been studied previously through weak 
lensing by Clowe (1998) based on deep ground based images. 

We first present the results of the weak lensing analysis 
of MS 2053. In section 2 we briefly discuss the data, and 
in section 3 the object analysis is described. The cluster 
light distribution is examined in section 4. In section 5 we 
present the weak lensing signal and the reconstruction of the 
projected surface mass density. The mass and mass-to-light 
ratio inferred from our analysis are presented in section 6. 
In section 7 we present the combined results of a sample of 4 
clusters that have analysed and calibrated in a uniform way. 
We compare the weak lensing mass estimates to dynamical 
estimates. We also study the average mass profile of the 
clusters, as well as their mass-to-light ratios. 



2 DATA 

To study the cluster MS 2053 we use a mosaic of WFPC2 im- 
ages taken with the Hubble Space Telescope. Figure ^ shows 
the layout of the mosaic constructed from the 6 pointings 
of the telescope. The cluster has been observed in two pass- 
bands. Each pointing in each filter consists of three separate 
short exposures, which allows an effective rejection of cosmic 
rays. The total integration time per pointing was 3300s in 
the F606W filter, and 3200s in the F8UW filter. The reduc- 
tion is described in van Dokkum et al. (2001). For the weak 

t Throughout this paper we will use Hq = 50 hsokm/s/Mpc, 
f2m = 0.3 and Q/^ = 0. This gives a scale of 1" = 8.2h^Q kpc at 
the distance of MS 2053. 
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Figure 1. Layout of the observed field. The six pointings are 
indicated. The dashed circle indicates an aperture with a radius 
of 1 h^Q Mpc centered on the brightest cluster galaxy (BCG). The 
area covered by the observations is approximately 26.5 armin-^. 

lensing analysis we omit the data of the Planetary Camera 
because the data do not reach the same depth as the Wide 
Field Camera. The total area covered by the observations is 
approximately 26.5 arcmin^. 



3 OBJECT ANALYSIS 

The weak lensing analysis technique is based on that devel- 
oped by Kaiser, Squires, & Broadhurst (1995), and Luppino 
& Kaiser (1997), with a number of modifications which are 
described in detail in HFKS and HFK. We analyse each 
WFPC2 chip separately, and combine the object catalogs to 
a master catalog once all objects have been analysed and 
the appropriate corrections have been applied. 

We use the hierarchical peak finding algorithm from 
Kaiser et al. (1995) to find objects with a significance > 5a 
over the local sky. These are analysed, which yields estimates 
for their sizes, magnitudes and shapes. As described in HFK 
we also estimate the error on the shape measurements, which 
allows a proper weighting of the sources. 

The resulting catalogs are inspected visually, and spu- 
rious detections, such as diffraction spikes, HII regions in 
resolved galaxies, etc. are removed. 

We then identify the objects that are detected in both 
the F606W and F814 images. For these we determine 
colours using the same aperture for both filters. The aper- 
ture that is used scales with the Gaussian scale length rg of 
the object. This results in a sample of 2155 objects, both 
galaxies and stars, with a corresponding number density of 
81 objects arcmin"^. The objects that are detected in only 
one filter are small, and faint, and as a result not useful for 
the weak lensing analysis. 
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Figure 2. Plot of the apparent magnitude in the F606W filter (a) and the i^814iy filter (b) versus half light radius r^. Because of the 
low galactic latitude of MS 2053 many stars are found, which correspond to the vertical sequence of points at ~ O'.'l. 



The magnitudes are zero-pointed to Vega, using the zero 
points given the HST Data Handbook (Voit et al. 1997). 
Figure ^ shows the plot of the apparent magnitude versus 
the object half light radius of the detected objects. 

Because of the low galactic latitude of MS 2053 many 
stars are found in the observed field. These are located in the 
vertical sequence of points at a half-light radius rh ~ O'.'l. 
The brightest stars saturate and have larger half light radii. 
Based on Figure ^ we select a sample of 198 moderately 
bright stars. These stars are used to study the PSF, and the 
results are used to correct the shapes of the faint galaxies 
for PSF anisotropy and the size of the PSF as described in 
HFKS. 

The observed polarizations in the FSl'iW images of 
these stars are presented in Figure ^ HFKS studied the 
WFPC2 PSF using observations of the globular cluster M4, 
and the pattern observed here is similar to the one presented 
in HFKS. 

The PSF changes slightly with time, and subtraction 
of the M4 model from the observations leaves systematic 
residuals. To improve the model for the PSF anisotropy we 
fitted a modified model to the shape parameters of the stars 
in the MS 2053. It is a scaled version of the M4 model with 
a first order polynomial added: 

new M4 , , , 

Pi = a- Pi -I- Co -I- cia; + C2y. 

This model fits the observed PSF anisotropy of stars in the 
MS 2053 field well (the reduced of the fit is 0.98 for 179 
stars). 

The next step is to determine the "pre-seeing" shear po- 
larizability P'' (Luppino & Kaiser 1997; HFKS). The mea- 
surements of P'' for individual galaxies are rather noisy, and 
therefore we bin the measurements as a function of the Gaus- 
sian scale length rg. 

Because of the poor sampling of WFPC2 images only 
the shapes of galaxies with size rg > 0'.'08 can be corrected 
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Figure 3. Observed polarization of stars selected from the 
F814W images. The sticks indicate the direction of the major 
axis of the PSF, as well as the size of the polarization. The po- 
larizations are measured using a Gaussian weight function with a 
dispersion of O'.'OT. The lower left panel corresponds to chip2, the 
lower right to chip 3, and the upper right one denotes chip 4. We 
have omitted chip 1, which is the planetary camera. 

reliably^. We select objects that have rg > O'.'OS and remove 
saturated stars from the catalogs. After this selection the 
sample of galaxies consists of 1540 galaxies analysed from 
the FQ06W images, and 1545 from the FSliW images. We 



that is 

a factor \/2 too large. This affects all objects, and therefore the 
limit listed here corresponds to the 0'.'12 given by HFKS. 
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Figure 4. (a) Colour-magnitude diagram of the galaxies in the 
full mosaic for which colours have been determined, (b) Colour- 
magnitude diagram for galaxies within 1 arcminute from the 
BCG. The cluster colour-magnitude relation is better visible in 
this diagram, although it is not as obvious as for other rich clus- 
ters. The line indicates the assumed cluster colour-magnitude re- 
lation. 



note that because of this cut not all objects appear in both 
catalogs anymore. 

Finally the shapes are corrected for the camera distor- 
tion, and the catalogs are combined into a master catalog. 
We use the estimated errors on the shape measurements to 
combine the results from the _F606VI^ and _F814W^ images in 
an optimal way. The resulting catalog includes 1677 galax- 
ies, which corresponds to a number density of 63 galaxies 



4 LIGHT DISTRIBUTION 

Figure ^ shows the colour of the galaxies in the full mo- 
saic versus their _F814W^ magnitude. Compared to the other 
two clusters for which we obtained HST mosaics, MS 2053 
is less obvious from the optical images. As a result the con- 
trast of the cluster colour-magnitude relation with the back- 
ground is lower. However, in the diagram for galaxies within 
1 arminute from the BCG the cluster colour-magnitude re- 
lation can be discerned (fig. ^). 

To estimate the hght contents of the cluster we define a 
sample of cluster galaxies as follows. Down to ^81414^ = 21 
we select spectroscopically confirmed cluster members (Van 
Dokkum et al. 2001, in preparation). At fainter magnitudes 
we use the colour-magnitude relation drawn in Figure ^b. 
We select galaxies with -0.4 < A{F606W - FSUW) < 0.2 
mag relative to the cluster colour-magnitude relation. To 
correct for contamination by field galaxies we subtract the 
counts from the Hubble Deep Fields north and south. The 
smoothed luminosity distribution of this sample is presented 
in Figure In Figures ^ and c grey scale images of the 



smoothed number density of bright (19.5 < FSIAW < 23) 
and faint (23 < F814W^ < 25) galaxies are presented. 
Around the position of the BCG a significant overdensity 
of galaxies is detected. Figure ^ shows most clearly that 
the light distribution is elongated in the direction where the 
arc is found (Luppino & Gioia 1992) . Figure |^ still shows 
an overdensity at the position of the cluster. Interestingly, 
it also shows a clear overdensity south of the cluster. The 
overdensity is caused by galaxies bluer than the cluster, but 
it is not clear whether they belong to another cluster along 
the line of sight. 

We estimate the cluster luminosity in the rest frame 
B band. To do so we use template spectra for a range in 
spectral types and compute the corresponding pass band 
correction (this procedure is similar to the method described 
in van Dokkum & Franx 1996). Thus we find the following 
transformation from the HST filters to the rest frame B 
band: 

B, = _F'814W^ + 0.47(F606W^ - F8UW) + 0.75, 

where B^ denotes the corrected B band magnitude. The 
luminosity is given by 

^ IqOA{Mbq-B,+DM+Aps14w) i^j^^ 

where Mbq = 5.48 is the solar absolute B magnitude, DM 
is the distance modulus, and Afsiaw is the extinction cor- 
rection in the F8UW filter towards MS 2053. The red- 
shift oi z = 0.58 for MS 2053 gives a distance modulus 
of 43.14 — 51og/i5o. We use the dust maps from Schlegel, 
Finkbeiner, & Davis (1998) to correct for the galactic ex- 
tinction. Because of the low galactic latitude of MS 2053, 
we find a rather high value of Afsi4w = 0.15. We have 
used SExtractor (Berlin & Arnouts 1996) to determine to- 
tal magnitudes for the galaxies. 

The cumulative light profile as a function of distance 
from the cluster centre is presented in Figure ^ The to- 
tal luminosity within an aperture of radius 1 h'^Q Mpc is 
(3.1 ± 0.4) X W^^h-ohBo- The error in the luminosity re- 
flects the uncertainty in the determination of cluster mem- 
bership, and the total magnitudes measured by SExtractor. 
We note, however, that the error is small compared to the 
uncertainty in the weak lensing signal (section 4 and fur- 
ther). At large radii the profile is rather steep. To compute 
the profile, we average the light distribution in circular bins. 
If the light distribution is elongated as suggested by Fig- 
ure 1^ this leads to an overestimate of the light at large 
radii, where the coverage is incomplete. 



5 WEAK LENSING SIGNAL 

Each galaxy gives only a noisy estimate of the weak lensing 
signal because of its intrinsic shape. Therefore we average 
the shape measurements of many sources to obtain a useful 
estimate of the distortion g. When we compute the ensem- 
ble averaged distortion, we weight the contribution of each 
object with the inverse square of the uncertainty in the mea- 
surement of the distortion as described in HFK. 

We select galaxies with 21.5 < F814W < 25.5, and 
< F6Q6W - F8UW < 1.4 in our sample of back- 
ground galaxies. As mentioned above, we exclude objects 
with sizes comparable to the PSF. The resulting sample 
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Figure 5. (a) Smoothed luminosity distribution from the sample of cluster galaxies. The definition of the sample is described in the 
text. The intervals between subsequent contours are lO^L^Qpc"'^. The position of the BCG has been used to define the cluster centre, 
and corresponds to the origin of this figure. The overdensity at the cluster centre is clearly visible, (b) Smoothed number density of 
bright galaxies (19.5 < F814W < 23). (c) smoothed number density of faint galaxies (23 < F814W < 25). At the position of the cluster 
a small overdensity is still visible. Another overdensity of faint galaxies is visible to the lower left of the cluster. The number density 
distributions have been smoothed using a Gaussian with a FWHM of 45" (indicated by the shaded circle). In figures (b) and (c) the 
intervals between subsequent contours are 5 galaxies arcmin"^. 



consists of 1130 galaxies, and has a median magnitude of 
F814:W = 24.3. Comparison with Figure ^ shows that some 
faint cluster members might end up in this sample of back- 
ground sources. The contamination will be most important 
in the central region of the cluster. To estimate the contam- 
ination by cluster members we examined the azimuthally 
averaged number density as a function of distance from the 
cluster centre. The profile is presented in Figure ^. The num- 
ber counts are slightly higher near the cluster centre, but the 
excess is not significant. In the further analysis we ignore the 
data inside 40" from the BCG. 

The average number density of sources is 43 galax- 
ies arcmin"^. Similar number densities can be reached in 
deep images taken from the ground (e.g., Bezecourt et al. 
2000). However, the main advantage of our observations over 
ground based images is the much smaller correction for the 
size of the PSF. As a result the lensing signal is better cali- 
brated, and the noise in the shape measurements from HST 
data is lower. 

Figure ^ shows the smoothed distortion field from the 
sample of source galaxies (we used a Gaussian with a FWHM 
of 45"). The position of the BCG corresponds to the origin 
of the plot. A systematic tangential alignment of the sources 
with respect to the cluster centre can be observed. 

We use the distortion field presented in Figure ^ to 
reconstruct the projected surface mass density. The mass 
reconstruction has been computed using the maximum like- 
lihood extension of the original KS algorithm (Kaiser & 
Squires 1993; Squires & Kaiser 1996). This algorithm has 
the advantage over direct inversion methods that it can be 
applied to fields with complicated boundaries, such as our 
mosaic. 

Figure Hb shows a grey scale image of the reconstructed 
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Figure 6. The cumulative restframe B band luminosity as a func- 
tion of radius from the cluster centre. The profile is calculated 
using the sample of cluster galaxies (selected both spectroscopi- 
cally and based on colour) . The arrow indicates a radius of 1 h'^^ 
Mpc. 



surface mass density. The peak in the mass distribution co- 
incides with the position of the BCG. A bootstrapping re- 
sampling of the shape measurements enables us to compute 
the noise map of the mass reconstruction, which is presented 
in Figure |^. The noise in the mass reconstruction increases 
rapidly towards the edges of the observed field. From the 
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Figure 7. Azimuthally averaged number density of background 
galaxies (21.5 < FSUW < 25.5 and < F606W - FSliW < 1.4) 
as a function of distance from the cluster centre. The counts 
have been corrected for the boundaries of the mosaic. No signifi- 
cant overdensity is found at the cluster position, which indicates 
that the contamination by cluster members is low. The dashed 
line shows the average number density of 43 background galaxies 
arcmin."^ 

noise map we find that the peak in the mass distribution is 
detected at the 3ct level. 



6 MASS AND MASS-TO-LIGHT RATIO 

The azimuthally averaged tangential distortion (gr) as a 
function of radius from the cluster centre is a useful measure 
of the lensing signal (e.g., Miralda-Escude 1991; Tyson & 
Fischer 1995). The tangential distortion is defined as gr ~ 
— ((?i cos 24>-\-g2 sin 20), where is the azimuthal angle with 
respect to the assumed cluster centre, for which we take the 
position of the BCG. 

The azimuthally averaged tangential distortion as a 
function of radius from the cluster centre is presented in Fig- 
ure ^. A singular isothermal sphere model (^(r) — te/'^t, 
where rs is the Einstein radius) gives a best fitted rs ~ 
6'.'3± I'.'S. To minimize the diluting effect of cluster galaxies 
on the weak lensing signal, we have excluded the measure- 
ments at radii smaller than 40" from the fit. 

6.1 Velocity dispersion 

The next step is to relate the measurement of the Einstein 
radius to a velocity dispersion, for which we use photomet- 
ric redshift distribution from the the northern and south- 
ern Hubble Deep Fields (Fernandez-Soto, Lanzetta, & Yahil 
1999; Chen et al. 1998). HFK examined the usefulness of 
photometric redshift distributions to calibrate the lensing 
signal and found that they work well. 

The amplitude of the lensing signal as a function of 
source redshift is characterized by f3, which is defined as 



/3 = msLx[0, Dis/ Ds], where Dis and Ds are the angular di- 
ameter distances between the lens and the source, and the 
observer and the source. To compute (/3) we also take into 
account that fainter galaxies are noisier and have a lower 
weight in the average. For our sample of sources we obtain 
{/?) = 0.29 (taking Q„, = 0.3, and = 0). Placing the 
background galaxies in a single source plane at z = 1 would 
yield a similar {[3). Using these results we derive a velocity 
dispersion of a = 886l]^3g km/s. 

The value of {/?) does not only depend on the redshifts 
of the sources, but it also depends on the cosmological pa- 
rameters that define the angular diameter distances. For an 
Qrn = 1, and Qa ~ model we find essentially the same 
(/?), and a = 88ll5^3g km/s. In a Qa dominated universe the 
changes are larger. Assuming Urn = 0.3, and Qa ~ 0.7 gives 
{/?) = 0.33, and results in a = 83lligQ km/s. 

The result from the weak lensing analysis is in excellent 
agreement with the observed velocity dispersion of 817 ± 80 
km/s, which was determined from the velocities of 52 cluster 
members (Van Dokkum et al. 2001, in preparation). 

Clowe (1998) obtained deep R band images of MS 2053 
with the Keck telescope, and measured the weak lensing sig- 
nal. He used a source redshift of z = 1.75 and derived a ve- 
locity dispersion of a ~ 700 km/s. Such high average source 
redshifts are unrealistic, in particular when compared to the 
z = 1 we use, based on photometric redshift distributions. 
For more realistic redshift distributions the result of Clowe 
(1998) increases to cr ~ 900 km/s, in good agreement with 
our results. 

Luppino & Gioia (1992) discovered a gravitationally 
lensed blue arc in deep images of MS 2053. The arc is located 
approximately 15 arcseconds north of the BCG. To date, the 
redshift of the arc is not known. If we assume a redshift of 
z = 2 for the arc, and adopt a SIS model (where the position 
of the arc gives the Eistein radius) the corresponding veloc- 
ity dispersion is about 1030 km/s. This value is higher than, 
but consistent with the weak lensing estimate. Moreover, if 
the mass distribution is elongated in the direction of the arc 
the strong lensing mass estimate is lowered. Figure |^ in- 
dicates that the light distribution is elongated, roughly in 
the direction of the giant arc. Because of the low signal-to- 
noise ratio of the weak lensing signal, we cannot constrain 
the elongation of the mass distribution. 

6.2 Mass-to-light ratio 

From the sample of cluster galaxies we estimate a total clus- 
ter luminosity of (3.1 ± 0.4) x 10^^ h^ghsQ within an aper- 
ture of radius 1 h'^g Mpc (see section 4). The best fit SIS 
model gives a projected mass of (5.7 ± 1.6) x IO^^/i^q^M© in 
the same aperture. Thus we obtain an average mass-to-light 
ratio of 184 ± 56 /iso Mq /Lb© within 1 h^g Mpc. 

Kelson et al. (1997) have studied the fundamental plane 
of MS 2053. They find that the early type galaxies in the 
cluster define a clear fundamental plane. Comparison with 
low redshift clusters suggests that the structure of early type 
galaxies has changed little since z — 0.58. Similar analyses 
have been performed for other clusters (e.g. van Dokkum & 
Franx 1996; van Dokkum et al. 1998) and have shown that 
the mass-to-light ratios of early type galaxies evolve with 
redshift, which is accounted to luminosity evolution. 

As a result also the global cluster mass-to-light ratios 
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Figure 8. (a) Smoothed distortion field g obtained using galaxies with 22 < F814VK < 26 and < F606VK - F814W < 1.6. The 
measurements have been smoothed using a Gaussian with a FWHM of 45 arcseconds (indicated by the shaded circle). The orientation 
of the sticks indicates the direction of the distortion, and the length is proportional to the amplitude of the signal. The origin of 
the plot coincides with the assumed cluster centre, (b) The corresponding reconstruction of the projected surface mass density. The 
interval between adjactent contours is 0.05 in k. (c) Noise map of the mass reconstruction from bootstrapping resampling of the shape 
measurements. It shows that the noise in the reconstruction increases rapidly towards the edges of the observed field. The interval 
between adjacent contours is 0.025. The peak in the mass reconstruction is detected at the 3cr level. 



evolve with redshift. The mass-to-light ratio of early type 
galaxies in MS 2053 in the B band is 37 ± 4% lower than 
present day values (Kelson et al. 1997). Under the assump- 
tion that the total luminosity of the cluster has changed by 
the same amount, we find an average mass-to-light within 
1 h^Q Mpc of 291 ±89 ±19 hso Mq/Lb©, corrected for lumi- 
nosity evolution to z = 0. The first contribution to the error 
budget is the statistical uncertainty in the determination of 
the mass-to-light ratio, and the second contribution is due 
to the uncertainty in the correction for luminosity evolution. 

Under the assumption that the light traces the mass 
we can derive the expected tangential distortion as a func- 
tion of radius. To measure the mass-to-light ratio, we scale 
the computed tangential distortion to match the ob- 
served signal. In Figure |lO^ the resulting profile (solid line) 
is shown. The ratio of the computed and observed signal is 
presented in Figure [ic|b. Because of possible contamination 
by faint cluster members we exclude the points at radii less 
than 40" from the fit. We find that the results are consistent 
with a constant mass-to-light ratio with radius, and we find 
an average value of 195 ± 58 hso Mq /Lb© . 



7 COMBINED RESULTS FROM RICH 
CLUSTERS 

With the analysis of MS 2053 we have a sample of three 
clusters of which the mass distribution has been studied us- 
ing mosaics of WPFC2 images. For the analysis in section 
7.3, we augment this sample with the z — 0.22 cluster Abell 
2219, which has been studied from the ground by Bezecourt 





z 


(2-10 keV) 


kT 


ref. 






10« ergs/s] 


[keV] 




A 2219 


0.22 


38 


9.5 ± 0.6 


1 


CI 1358+62 


0.33 


11.4 ± 0.3 


6.9 ± 0.5 


2 


MS 2053-04 


0.58 


7.9 ± 0.7 




2 


MS 1054-03 


0.83 


28.6 


1 2 4+'^--'- 
J-^.1_2.2 


3 



Table 1. X-ray properties of the three clusters for which we have 
obtained HST mosaics, as well as the properties of A 2219. Ref- 
erences: (1) Allen (1998); (2) Henry (2000); (3) Donahue et al. 
(1998). 



et al. (2000). Their analysis is identical to ours, and like for 
our clusters, photometric redshifts have been used to relate 
the lensing signal to the mass. 

All clusters are X-ray selected, and their X-ray proper- 
ties are listed in Table ^ 

7.1 Comparison between weak lensing mass and 
dynamical mass 

The large number of spectroscopic confirmed members in 
each of the remaining three clusters results in accurate mea- 
surements of their galaxy velocity dispersions. In this section 
we compare the weak lensing estimates of the cluster veloc- 
ity dispersions to the velocity dispersion of the galaxies. In 
Table H we list the results of the best fit SIS model to the 
observed weak lensing signal, as well as the corresponding 
velocity dispersion inferred from lensing. 
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Abell 2219 


z 
0.22 


rs 
["] 

17.4 ± 2.0 


(/3> 
0.46 


cr (WL) 
[km/s] 
10751^1 


a (galaxies) 
[km/s] 


ref. 


CI 1358+62 


0.33 


10.8 ± 1.4 


0.56 


835+fe 


910 ± 54 


1 


MS 2053-04 


0.58 


6.3 ± 1.8 


0.29 


8861121 


817 ±80 


2 


MS 1054-03 


0.83 


11.5 ± 1.4 


0.23 


13111*89 


1150 ±90 


3 



Table 2. Results from the weak lensing analyses of the three clusters for which HST mosaics were obtained. References: (1) Carlberg et 
al. (1997); (2) Van Dokkum et al. (2001, in preparation); (3) van Dokkum (1999). 



0.2 




200 




Figure 9. (a) Average tangential distortion as a function of ra- 
dius from the cluster centre, for which we took the position of 
the brightest cluster galaxy. The line corresponds to the pro- 
file of a singular isothermal model («(?") = r£/2r, where te is 
the Einstein radius) fitted to the data. Because of possible con- 
tamination by faint cluster members we only fit to the data at 
radii larger than 40" . The best fit value for the Einstein radius is 
= 6'.'2 ± 1'.'8. (b) average signal when the phase of the distor- 
tion is increased by 7r/2. If the signal shown in (a) is caused by 
gravitational lensing, {gx) should vanish, as is observed. In both 
figures, the arrows indicate a radius of 1 Kz} Mpc. 



Figure 10. (a) Plot of the average tangential distortion as a 
function of radius from the BCG. The solid line is the expected 
tangential distortion (scaled by the mass-to-light ratio to fit the 
observations) derived from the average radial light profile, assum- 
ing the mass-to-light ratio is constant with radius, (b) The ratio 
of the observed distortion and the derived distortion from the 
light (taking M/Lg = 1 in solar units). The shaded region indi- 
cates the one cr region around the average of the points with radii 
larger than 40". The observations are consistent with a constant 
mass-to-light ratio of 196 ± 58 /150 Mq/Lb0. In both figures, the 
arrows indicate a radius of 1 h'^^ Mpc. 



For all three clusters we use the photometric redshift 
distribution from the Hubble Deep Fields (Fernandez-Soto 
et al. 1999; Chen et al. 1998). This lowered the {(5) for 
CI 1358 slightly compared to the value used in HFKS, and 
the new estimate for the weak lensing velocity dispersion is 
listed in Table ^. 

The comparison with the velocity dispersions of cluster 
galaxies shows a good agreement in all three cases, suggest- 
ing that the galaxy velocity dispersions are characteristic of 
the cluster as a whole. Similar comparisons have been made 
in the past (e.g.. Small et al. 1997; Wu et al. 1998; Allen 
1998). The samples of clusters used by Wu et al. (;i998), 
and Allen (1998) are rather inhomogeneous: different meth- 
ods were used for the correction of the circularization of the 



background sources, and no realistic redshift distributions 
of the sources have been used. Wu et al. (1998) and Allen 
(1998) find in general a fair agreement between the veloc- 
ity dispersions of the galaxies and the velocity dispersions 
derived from weak lensing analyses. 

A more systematic study was presented in Small et al. 
(1997) who determined the weak lensing signal of 12 distant 
clusters observed with WFPC2. Each cluster was observed 
with one pointing on the cluster core. 

Small et al. (1997) computed the average tangential dis- 
tortion within an annulus 120 < r < 400 K^q kpc, and 
plotted the result against the observed cluster dispersion. 
Under the assumption that the cluster mass distribution is 
described by a SIS model, this measurement provides an es- 
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Figure 11. Weak lensing velocity dispersion (as inferred from the 
best fit SIS model) versus the observed velocity dispersion of the 
galaxies. The large open points are the results from our HST mo- 
saics, measuring the distortion in an annulus 120 < r < 400 h'^Q 
kpc. The results from Small et al. (1997) are measured in the 
same annulus and are indicated by the small dots. The large solid 
points correspond to the measurements when the complete HST 
mosaics are used. The line of equality between the weak lensing 
velocity dispersion and the velocity dispersion of cluster galaxies 
is indicated by the dashed line. Our large field weak lensing re- 
sults agree well with this line. To allow a direct comparison with 
Figure 4 from Small et al. (1997) we have indicated the average 
tangential distortion in an annulus 120 < r < 400 h'^^ on the 
right hand axis. 



timate for the velocity dispersion. Figure ^ shows the weak 
lensing estimate of the velocity dispersion versus the velocity 
dispersion of cluster members for our sample. 

The large open points are the results from our HST 
mosaics, when we confine the weak lensing analysis to the 
annulus used by Smail et al. (1997). The large solid points 
show the results when the full HST mosaic is used for the 
weak lensing analysis. The right hand vertical axis displays 
the value of the average tangential distortion in the annulus 
used by Smail et al. (1997), and allows a direct comparison 
with their Figure 4. For comparison, we also show the results 
from Smail et al. (1997) (small dots). 

Smail et al. (1997) found a discrepancy between their 
weak lensing signal and the velocity dispersion of the galax- 
ies. They argued that the velocity dispersions from the 
galaxies are overestimated by ~ 40%, compared to the ve- 
locity dispersion expected from the weak lensing analysis. 
When we confine the weak lensing analysis of our HST mo- 
saics to the same annulus used by Smail et al. (1997) we 
too find that the velocity dispersion inferred from the weak 
lensing analysis is lower than the velocity dispersion of the 
galaxies. However, the results based on the full mosaics agree 
well with the line of equality (dashed line). 

Limiting the weak lensing analysis to the cluster core 
results in a systematic underestimate of the cluster veloc- 
ity dispersion. The largest change is seen for MS 1054. For 
this cluster the explanation is straightforward. HFK showed 



that the mass distribution in the cluster centre is complex, 
consisting of three distinct clumps. As a result the average 
tangential distortion is lowered, when only the inner 400h^Q 
kpc are considered in the analysis. 

Several effects can introduce a systematic offset between 
the weak lensing results and the dynamical measurements 
(e.g., Smail et al 1997). Because we find a good agreement 
between the two estimates when the lensing signal is mea- 
sured from wide field data, we argue that it is the cluster 
mass profile in the core that gives rise to the discrepancy. 
Only if the profile is isothermal, one expects a good agree- 
ment, but substrucure or a shallower density profile results 
lowers the lensing signal compared the value expected from 
the SIS model. 



7.2 Average cluster mass profile 

Numerical simulations have indicated that dark matter halos 
originating from dissipationless collapse of density fiuctua- 
tions may follow a universal density profile (e.g., Navarro, 
Frenk, & White 1997). The Navarro, Frenk, & White (NFW) 
profile appears to be an excellent description of the radial 
mass distribution in these simulations. The NFW profile is 
given by 



p{r) 



ScPc 



(r/rs)(l + r/rs 



(1) 



where pc is the critical density of the universe, 5c is the 
characteristic overdensity, and is the scale radius given by 
Ts ~ r2oo/c, which all depend on the redshift and mass of 
the halo. The parameter c is referred to as the concentration 
parameter. Given the cosmology, redshift, and mass of the 
halo, r2oo follows immediately, and the values of Sc, and c 
can be computed using the routine CHARDEN made available 
by Julio Navarro^. 

We have fitted the predicted profiles from NFW halos 
to the observed tangential distortion of each cluster, and 
the best fit parameters are listed in Table ^ We have used 
a value of F = 0.18 for the shape parameter of the CDM 
power spectrum. The parameter that we fitted is M200, the 
mass enclosed within a sphere of radius r^oo, and the other 
parameters are the ones produced by CHARDEN given M2oo- 
The errors for r2oo, c, and listed in Table ^jonly reflect the 
uncertainty in these parameters because of the uncertainty 
in the measurement of M2oo- We note that the resulting 
parameters are mainly determined by the amplitude of the 
lensing signal (i.e. the mass of the halo) and not by the shape 
of the density proflle. Because of the strong substructure in 
the centre of MS 1054 we excluded the measurements at 
radii less than 75 arcsec. 

We now examine whether the NFW predictions match 
the actual observations. To do so, we scale the amplitude of 
the tangential distortion profiles of the four clusters to the 
signal of a cluster with M200 of 5 x W^'^h^^ Mq at a redshift 
z = 0.5 (where we placed the sources at infinite redshift), 
and scale the data radially in units of the derived value of 
T-s (listed in Table ^, the scale length of the NFW profile. 
The resulting ensemble averaged tangential distortion as a 



t The 



routine 



CHARDEN 



tittp : //pinot .phys .uvic . ca/ ~jf n/ charden 



be obtained 



from 
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(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) 

z M200 r-200 c Ts Xnfw > Xnfw) ^lis ^(^^ > ^Sis) 

[lO^h^-i Mq] [ftg-i Mpc] [hg-i kpc] 

A 2219 0.22 I6.6I3 2 2S3to,ll S.Oltp " 524^4^ 8.6 0.74 10.0 0.62 

CI 1358+62 0.33 2.8^0.8 l-SSto.M 5.86^0 231^3^ 15.1 0.24 15.7 0.20 

MS 2053-04 0.58 2.8^1-5 ^-^^to.lr ^■'^^toU 217^61 8.6 0.74 9.6 0.65 

MS 1054-03 0.83 14.4^2.6 l-Sl^aw 4.46to.o8 ^07^34 11.3 0.13 11.5 0.12 



Table 3. Best fit parameters for the NFW profile (columns 3-6). The fitted paramter is M200 and the other parameters are computed 
using CHARDEN, for an OCDM cosmology with flm = 0.3, Oa = Oj a^nd shape parameter F = 0.18. The errors indicate the 68% confidence 
intervals. Columns 7 and 9 give the foi" the best fit NFW profile, and SIS model, respectively. The probabilities of finding a larger 
value for are listed in columns 8 and 10. 



function of is presented in Figure |l2|a. This figure also 
shows the best fit SIS model (dashed line) and the best fit 
NFW profile (solid line). 

The NFW profile provides a good fit to the data (x^ = 
12.8; P{x^ > 12.8) = 0.38). The SIS model fit is worse 
with a = 17.1 (P(x^ > 17.1) = 0.15). The NFW model 
that is fitted to the observations has a concentration param- 
eter that is a X cocdm- Thus we test whether the predicted 
concentration parameters agree with the observed lensing 
signal. Figure 1^ shows Ax^ ~ ~ Xmin as a function 
of a in units of cqcdm. We find that the best fit value is 
0.791q^5 X Cocdm- Thus the predicted concentration pa- 
rameter is in good agreement with the observations. 

In the above, we have used NFW models for which the 
parameters were obtained from the lensing data. Although 
the parameters are essentially determined by the ampli- 
tude of the lensing signal, it is useful to examine this in 
more detail. To do so, we use the observed velocity disper- 
sions of the galaxies to obtain an estimate of r^oo, using 
^■200 = (-\/3crgai)/(10Ji'(2:)) , where H{z) is the value of the 
Hubble parameter at the redshift of the cluster. We omit 
A 2219, because the galaxy velocity dispersion is not known. 
We compute the NFW parameters and scale the tangential 
distortion profiles of the three remaining clusters. The re- 
sulting ensemble averaged profile is compared to the NFW 
profile in the same way as before. We find that the NFW 
profile is a good fit (x^ = 13.4), and that the SIS model fits 
worse (x^ ~ 20.5). The best fit concentration parameter is 
found to be 0.831q'37 x cocdm. Thus both approaches yield 
similar results. 

These results shows that a systematic weak lensing 
study of a number of clusters provides a direct way to test 
consistency of the predictions of the theory of dissipationless 
collapse in CDM cosmologies. 
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Figure 12. (a) The ensemble averaged tangential distortion as a 
function of radius (in units of rs = r200 / c) for the four clusters 
in the samples. We used the concentration parameters listed in 
Table ^. The solid line is the best fit NFW profile, and the dashed 
line is the best fit SIS model. The NFW provides the best fit 
(X^ = 12.8), whereas the SIS model fit is worse (x^ = 17.1). (b) 



X 



^2. 



Ax^ 

and predicted concentration parameter a/cocDM. The best fit 
NWF profile yields an observed concentration parameter that is 
0.79toit X '^OCDM 



7.3 Cluster mass-to-light ratio 

Table ^ lists the estimates of the average mass-to-light ratio 
within an aperture of 1 h'^^ Mpc radius. All values are given 
in the restframe B band. Except for A 2219, the luminosity 
evolution of the early type galaxies has been measured by 
studying the fundamental plane (e.g.. Kelson et al. 1997; Van 
Dokkum et al. 1998.) Under the assumption that the global 
cluster mass-to-light ratio evolves similarly with redshift we 
can correct the observed mass-to-light ratio for luminosity 
evolution to 2: = 0, and the results are also listed in Table m 
The uncertainty in the luminosity evolution results in an 



additional contribution to the total error budget. In Table ^ 
we list the statistical error in the measurement of the mass- 
to-light ratio and the uncertainty due to the correction for 
luminosity evolution separately. 

Figure |l^ shows the observed average mass-to-light ra- 
tio in the B band within an aperture of 1 h'^Q Mpc as a 
function of cluster redshift. The dashed region in Figure |l^ 
corresponds to the average of the observed mass-to-light ra- 
tios (i.e. assuming no luminosity evolution), which yields a 
value of M/Lb = 151 ± I2/150 Mo/Lb©. The observations 
are inconsistent with an unevolving cluster mass-to-light ra- 
tio that does not evolve at the 99% level. 
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z 


M/Lb (obs) 


M/Ls {z = 0) 






[/IBO Mq/Lbq] 


[ftso Mq/Lbq] 


A 2219 


0.22 


210 ± 24 


256 ± 29 ± 6 


CI 1358+62 


0.33 


141 ± 23 


186 ± 30 ± 10 


MS 2053-04 


0.58 


184 ± 56 


291 ± 89 ± 19 


MS 1054-03 


0.83 


124 ± 17 


269 ± 37 ± 31 



Table 4. Average mass-to-light ratio within apertures of 1 h^^ 
Mpc radius. The error budget of the mass-to-light ratios corrected 
for luminosity evolution consists of the statistical uncertainty in 
the measurement of the mass-to-light ratio (first error) and the 
contritbution due to the uncertainty in the luminosity evolution 
(second error). 
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Figure 13. The observed average mass-to-light ratio within a 
1 /tjTjj"' Mpc radius aperture of the clusters in the sample as a 
function of rcdshift. The shaded region indicates the Icr region 
around the average mass-to-light ratio (assuming no luminosity 
evolution). The assumption of an uncvolving mass-to-light ratio is 
excluded at the 99% confidence level. The solid line corresponds to 
the luminosity evolution as a function of redshift as inferred from 
studies of the fundamental plane of distant clusters of galaxies 
(Van Dokkum et al. 1998), and is consistent with the observations. 



The solid line corresponds to the luminosity evolution 
as a function of redshift as inferred from studies of the fun- 
damental plane of distant clusters of galaxies (Van Dokkum 
et al. 1998), scaled to fit the observed total cluster mass-to- 
light ratios. The evolution of the cluster mass-to-light ratio 
of X-ray selected clusters is consistent with the evolution 
of the mass-to-liglit ratio of the early type galaxies. Van 
Dokkum et al. (1998) found that the M/Lb ratio evolves as 
A log M/Ls oc (— 0.40 ± 0.04)a, which results in an average 
value of M/Lb = 239 ± 18 ± 9^50 M© /Lb© for clusters at 
z = Q. The first error indicates the statistical uncertainty 
in the measurement of the mass-to-light ratio, and the sec- 



ond error indicates the additional error introduced by the 
uncertainty in the luminosity evolution. 

Carlberg et al. (1997) analysed a sample of 16 rich clus- 
ters, and also found that the cluster mass-to-light ratios are 
consistent with a universal value. They found an average 
value oi M / Lr — 119 ± 21 h^oMQ /Ltq. To convert this 
value to a mass-to-light ratio in the B band, we assume an 
average colour of the cluster of S — r = 1.07, which corre- 
sponds to the typical colour of SO galaxies (j0rgensen et al. 
1995). Thus we find that the estimate for the average cluster 
mass-to- light ratio from Carlberg et al. (1997) corresponds 
to 219 ± 38 hso M© /Lb© (where we also corrected for lumi- 
nosity evolution to « = 0), in excellent agreement with our 
results. 

Given the small spread in cluster mass-to-light ratios 
the star formation efficiency in rich clusters appears to be a 
well regulated process, although the sample of clusters needs 
to be increased before firmer conclusions can be drawn. 



8 CONCLUSIONS 

We have presented the results of our weak lensing analysis of 
MS 2053-04, a cluster of galaxies at a rcdshift z — 0.58, for 
which we detect a clear lensing signal. It is the third cluster 
we have studied using a two-colour mosaic of deep WFPC2 
images. Previously wc have studied CI 1358-1-62 (z = 0.33; 
HFKS) and MS 1054-03 (z = 0.83; HFK). 

The selected sample of background sources (with 21.5 < 
^814^^ < 25.5 and < F606W-F8UW < 1.4) has a num- 
ber density of 43 galaxies arcmin"^. Similar number densi- 
ties can be reached in deep ground based observations, but 
the correction for the circularization by the PSF in WFPC2 
images is much smaller. As a result the lensing signal can 
be measured more accurately from space based images. 

The position of the peak in the reconstruction of the 
cluster mass surface density agrees well with the peak in the 
light distribution. To measure the mass of the cluster we 
fit a SIS model to the observed azimuthally averaged tan- 
gential distortion. The corresponding value for the Einstein 
radius is te = 6'.' 2 ± I'.'S. To relate the Einstein radius to an 
estimate of the cluster velocity dispersion, we use published 
photometric redshift distributions inferred from the north- 
ern and southern Hubble Deep Fields. The best fit SIS model 
corresponds to a velocity dispersion of u = 886l|^3(j km/s, 
which is in excellent agreement with the observed velocity 
dispersion of cluster galaxies of 817 ± 80 km/s. 

We have analysed the weak lensing signal of 3 clusters 
using wide field HST data, and we find that the velocity 
dispersion derived from weak lensing agrees well with the 
velocity dispersion of the cluster galaxies. This result differs 
from Small et al. (1997) who compared the weak lensing sig- 
nal to the galaxy velocity dispersion using HST observations 
of cluster cores. Based on our results we argue that the dis- 
crepancy is caused by deviations from the SIS model in the 
inner regions of clusters (substructure or a fiatter profile). 
To obtain an accurate estimate of the weak lensing velocity 
dispersion wide field data axe necessary. 

We use a sample of 4 clusters that have been analysed 
uniformly to study the average cluster profile. The NEW 
profile fits the ensemble averaged lensing signal well, and 
the predicted concentration parameter is in good agreement 
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with the observations: the observed value is found to be 
0.79lg 15 times the predicted value for an OCDM model. 

The observed average mass-to-light ratio of MS 2053 
within a 1 /i^q^ Mpc radius aperture is 184±56 ftso Mo/Lb©. 
We have examined the mass-to-light ratios of the clusters in 
our sample, and find that the results are inconsistent with 
a non-evolving universal mass-to-light ratio. The measure- 
ments are consistent with a universal mass-to-light ratio for 
rich, X-ray selected, clusters of galaxies which evolves with 
redshift similarly to the luminosity evolution of the clus- 
ter galaxies (e.g.. Kelson et al. 1997; van Dokkum et al. 
1998). The average cluster mass-to-light ratio, corrected to 
2 = 0, is found to be M/Lb = 239±18±9 h^o /150 Mq/Lb© 
(where the first error indicates the statistical uncertainty in 
the measurement of the mass-to-light ratio, and the second 
error is due to the uncertainty in luminosity evolution), in 
good agreement with the results from Carlberg et al. (1997) 
based on a dynamical study of 16 rich clusters. The small 
spread in cluster mass-to-light ratios suggests that the total 
star formation in clusters is a well regulated process. 
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